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How to estimate the number of self-avoiding walks over 10 100 ? 

Use random walks. 
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Counting the number of JV-step self-avoiding walks (SAWs) on a lattice is one of the most difficult problems of 
enumerative combinatorics. Once we give up calculating the exact number of them, however, we have a chance to 
apply powerful computational methods of statistical mechanics to this problem. In this paper, we develop a statistical 
enumeration method for SAWs using the multicanonical Monte Carlo method. A key part of this method is to expand 
£_) , the configuration space of SAWs to random walks, the exact number of which is known. Using this method, we 

D ■ estimate a number of ./V-step SAWs on a square lattice, qv, up to N = 256. The value of t'256 is 5.6(1) x 10 (the 

number in the parentheses is the statistical error of the last digit) and this is larger than one googol (10 I0() ). 

KEYWORDS: Self-avoiding walks, statistical enumeration, multicanonical Monte Carlo method, rare 
event sampling 
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1 Introduction 

Starting from the origin of a lattice and connecting a line to one neighboring site after another, we get a path. If we 
choose a random direction for each step, the path becomes a random walk. The question we seek to answer is "How 
many A^-step random walks are there on a square lattice?" Since there are four possible directions for each step, the 
answer is 4 N . If we restrict the path by limiting visits to the same site, we get a self-avoiding walk (SAW) instead of a 
random walk. We then ask the similar question. "How many A^-step SAWs are there on a square lattice?" The answer, 
however, is not so simple as with random walks. The number of SAWs, c/v, is not known for large values of N. There 

jy-\ are some sophisticated enumeration methods for SAWs and exact values for cn on a square lattice are known only up to 

CO ! N = 71 [1]. The enumerated c N ofN = 71 is 

30 



c n = 4 190 893 020 903 935 054 619 120 005 916 - 4.19 x 10 J 



which is much larger than Avogadro's constant. In order to go further, we change the problem from calculating all digits 
of Cjv to estimating the first two or three digits of c/v. How large a c/v can we estimate? Here, we set a tentative goal of a 
googol (10 100 ). 

One popular strategy in enumeration is divide and conquer. We propose, however, a different strategy to estimate a 
S^ \ large cjy, in which we expand the configuration space of SAWs to that of random walks and use knowledge of the number 
of random walks. In order to explain how to use this exact number, we describe a statistical method of estimating the 
area of a unit circle using Monte Carlo integration as an example. In Fig. 1 (a), we show a unit circle surrounded by a 
square, the area of which is exactly 4. To calculate the area of the circle, we randomly generate n points on the square 
using pairs of real random numbers (n,^) ( — 1 <ri,r% < 1) and count the number of pairs which satisfy the condition 
x 2 +y 2 < 1. Let this number be n c . Using n c and n, we can estimate the area ratio of the circle to the square. Since we 
know that the area of the square is 4, we can finally estimate the area of the unit circle as 4 x n c /n. Let us apply this logic 
to the estimation of c#. 

In Fig. 1 (b), we show a Venn diagram of SAWs and random walks. The number of A^-step random walks on a d- 
dimensional hypercubic lattice is exactly known as (2d) N . Instead of generating random points as in the above example, 
we use Markov chain Monte Carlo (MCMC) to calculate the ratio of the number of SAWs to that of random walks. Once 
we obtain the ratio, we can estimate c^ by multiplying the ratio by (2d) N . 
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Fig. 1 : (a) Estimating the area of the unit circle using Monte Carlo integration. The values in the parentheses are the areas of the 
shapes. 79 of 100 points were plotted inside the circle (pink dots) and the area of the circle can be estimated as 4 x 79/100 = 3.16. (b) 
A Venn diagram of SAWs and random walks. In order to calculate Cjf, we expand the configuration space of SAWs to random walks, 
the number of which is exactly (2d) N . 



2 The statistical enumeration method for self-avoiding walks 

2.1 SAWs are rare random walks 

We define a self-avoiding walk (SAW) on a of-dimensional hypercubic lattice and its mathematical notation. We 
denote an A^-step path as (0 = (o(0),o(l),--- ,(o(N)), where each of co(i) (i = 0,1,--- ,N) denotes a point which is 
randomly selected from the lattice. When we impose two conditions, namely (i) O)(0) is at the origin of the lattice, and 
(ii) \(o(i + 1) — G)(i)\ = 1 (j = 0,1, — ,N—i) on co, CO becomes an A^-step random walk. If we add one more condition 
(iii) (o(i) y^ (0{j) (i ¥" J) on fi) ' ffl becomes an A^-step SAW [2]. It is obvious from the definition of a SAW that the set 
of A^-step SAWs is a subset of that of A^-step random walks. SAWs are known as n — >• limit of n-vector model and 
have attracted a great interest of physicists in the context of critical phenomena. As we mentioned above, we expand the 
configuration space of SAWs to that of random walks and try to sample SAWs using MCMC. 
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Fig. 2: A 12-step random walk with three intersections. There are one intersection on the purple circle and two intersections on the 
orange circle. Total intersections V of the path is 3. 



Before we move on to the sampling method, we would like to discuss how rare SAWs are. The asymptotic behavior 
of cjv is conjectured to be 



c N =An N N?- 1 [l+o(l)], 



(2.1) 



where fi is called the connective constant, which is known to be ~ 2.64 [1]. 7 is a critical exponent, which is thought to 
be universal. Namely, it depends only on the spatial dimensionality and it is independent of the lattice type. In order to 
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Fig. 3: A procedure of calculating c(V). c(V) is the number of random walks with V intersections, and is calculated by c(V) 
(2d) w Q(V)/E?^fl(V). 



roughly estimate the ratio of the number of S AWs to that of random walks on a square lattice, we use the approximation 



(2d) N 
In the case of N = 50, 150 and 250, the ratios are 



2.64 \ 



50 



10 



-9 



2.64 



4~. 



150 



10 



2.64 



-27 



(2.2) 



2.64 



250 



10 



-45 



Even with N = 50, SAWs are very rare, and if we randomly generate random walks, we need about 10 9 samples to 
get one SAW. In the case of N = 250, SAWs become extremely rare and we cannot get one SAW by random sampling 
even if we use the massive computer resources available today. Thus, we need a more efficient sampling method. 

2.2 Dividing the configuration space of random walks by the number of intersections 

As a first step towards an efficient sampling method, we introduce the number of intersections V as a parameter. In 
Fig. 2, we show a configuration of a random walk with three intersections, as an example. If k points of a path (k > 3) 
are on the same site, we define that there are k — 1 intersections on that site. We divide the configuration space into N 
parts (V = 0,1,2, ••• ,N— 1) according to the value of V, and configurations with V = are all SAWs. If we introduce 
the energy of random walks by V, we get the modified Domb-Joyce model [3,4]. In the high-and low-temperature limits, 
this model reduces to random walks and SAWs respectively. From the discussion above, we need the ratio of the number 
of SAWs to that of random walks to estimate cn. If we denote the density of states of the modified Domb-Joyce model 
as Cl(V), £2(0)/£y~o£2(V) gives the desired ratio, and we can calculate cn by 



c N = (2df 



£2(0) 



Y-JV— 1 

Ly 



W) 



(2.3) 



In order to calculate cn accurately, we should ensure the accuracy of not only £2(0) but £2(V) for the entire range of V. 
It is difficult to do so, however, because the value of £2(V) varies across many orders of magnitude. To overcome these 
difficulty, we use a powerful computational method of statistical mechanics. 

2.3 Multicanonical simulation of the modified Domb-Joyce model 

In the next step, we explain how to sample SAWs. We use the multicanonical Monte Carlo method [5,6] to accurately 
estimate the density of states of the modified Domb-Joyce model, £2(V), over a wide range of V. This calculation is 
essentially the same as that of the multi-self-overlap ensemble [7, 8] without Hamiltonian. A key to this method is 
the weight function W(V) that is built up to be proportional roughly to 1/£2(V) by iterative methods. Frequently-used 
iterative methods are as follows: 

• the multicanonical method [5, 6] 
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• the entropic sampling method [9] 

• the Wang-Landau method [10, 1 1]. 



We used the Wang-Landau method. We omit explaining these iterative methods here, and explain the multicanonical 
method, assuming that W(V) has already been obtained. In the multicanonical method, the transition probability from 
one path, (O a (V = V„), to another path ft)/, (V = Vj,) is given by 



p(co a — > (Ob) = min 



W(V b ) 

w(v a y 



(2.4) 



where this form of the transition probability is the same as that of the Metropolis method with W(V) instead of the 
Boltzmann factor. An MCMC simulation using this transition probability generates an ensemble called a multicanonical 
ensemble. Sampling from this ensemble gives a flat histogram H(V) (in the top left graph of Fig. 3). Dividing H(V) by 
W(y), we obtain £l(V) (in the bottom left graph of Fig. 3). From Eq. 2.3 with £2(V) obtained, we can finally calculate 
cn (in the bottom right graph of Fig. 3). 

3 Results 

Using the proposed method above, we calculate c^ with a statistical error up to N = 256. Some of the calculated cn 
values are shown in Table 1 , together with the available exact values (A 7 < 7 1 ) and estimated values given by Prellberg 
and Krawczyk using flatPERM [12]. The errors were estimated from the standard errors of histograms. The estimated 
values agree well with exact values and estimated values of flatPERM. We should note that flatPERM achieved longer 
walks up to C1028 = 1 -74 x 10 438 , but due to the lack of error estimation, it is now known how many digits of the values 
are reliable. The largest value of our estimation is 

c 256 = 5.6(1) xlO 108 , 



and we achieved the goal of a cn value of over 10 



100 



Table 1: List of exact and estimated qv values on a square lattice 



N 


cn (exact) 


c N (flatPERM [12]) 


cjv (our method) 


4 


100 




1.0001(2) xlO 2 


8 


5916 




5.915(2) x 10 3 


16 


1.7245 x 10 7 


1.7265 x 10 7 


1.724(1) x 10 7 


32 


1.1903 x 10 14 


1.1911 x 10 14 


1.186(2) x 10 14 


64 


4.5493 x 10 27 


4.5529 x 10 27 


4.54(2) x 10 27 


128 




5.2970 x 10 54 


5.32(6) x 10 54 


256 




5.6700 x 10 108 


5.6(1) x 10 108 
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